Inverse-gamma Distribution
   HOME

TheInfoList



OR:

In
probability theory Probability theory is the branch of mathematics concerned with probability. Although there are several different probability interpretations, probability theory treats the concept in a rigorous mathematical manner by expressing it through a set o ...
and
statistics Statistics (from German language, German: ''wikt:Statistik#German, Statistik'', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of ...
, the inverse gamma distribution is a two-parameter family of continuous
probability distribution In probability theory and statistics, a probability distribution is the mathematical function that gives the probabilities of occurrence of different possible outcomes for an experiment. It is a mathematical description of a random phenomenon i ...
s on the positive
real line In elementary mathematics, a number line is a picture of a graduated straight line (geometry), line that serves as visual representation of the real numbers. Every point of a number line is assumed to correspond to a real number, and every real ...
, which is the distribution of the
reciprocal Reciprocal may refer to: In mathematics * Multiplicative inverse, in mathematics, the number 1/''x'', which multiplied by ''x'' gives the product 1, also known as a ''reciprocal'' * Reciprocal polynomial, a polynomial obtained from another pol ...
of a variable distributed according to the
gamma distribution In probability theory and statistics, the gamma distribution is a two-parameter family of continuous probability distributions. The exponential distribution, Erlang distribution, and chi-square distribution are special cases of the gamma distri ...
. Perhaps the chief use of the inverse gamma distribution is in
Bayesian statistics Bayesian statistics is a theory in the field of statistics based on the Bayesian interpretation of probability where probability expresses a ''degree of belief'' in an event. The degree of belief may be based on prior knowledge about the event, ...
, where the distribution arises as the marginal posterior distribution for the unknown
variance In probability theory and statistics, variance is the expectation of the squared deviation of a random variable from its population mean or sample mean. Variance is a measure of dispersion, meaning it is a measure of how far a set of numbers ...
of a
normal distribution In statistics, a normal distribution or Gaussian distribution is a type of continuous probability distribution for a real-valued random variable. The general form of its probability density function is : f(x) = \frac e^ The parameter \mu ...
, if an
uninformative prior In Bayesian statistical inference, a prior probability distribution, often simply called the prior, of an uncertain quantity is the probability distribution that would express one's beliefs about this quantity before some evidence is taken into ...
is used, and as an analytically tractable
conjugate prior In Bayesian probability theory, if the posterior distribution p(\theta \mid x) is in the same probability distribution family as the prior probability distribution p(\theta), the prior and posterior are then called conjugate distributions, and th ...
, if an informative prior is required. It is common among some Bayesians to consider an alternative parametrization of the
normal distribution In statistics, a normal distribution or Gaussian distribution is a type of continuous probability distribution for a real-valued random variable. The general form of its probability density function is : f(x) = \frac e^ The parameter \mu ...
in terms of the
precision Precision, precise or precisely may refer to: Science, and technology, and mathematics Mathematics and computing (general) * Accuracy and precision, measurement deviation from true value and its scatter * Significant figures, the number of digit ...
, defined as the reciprocal of the variance, which allows the gamma distribution to be used directly as a conjugate prior. Other Bayesians prefer to parametrize the inverse gamma distribution differently, as a
scaled inverse chi-squared distribution The scaled inverse chi-squared distribution is the distribution for ''x'' = 1/''s''2, where ''s''2 is a sample mean of the squares of ν independent normal random variables that have mean 0 and inverse variance 1/σ2 = τ2. The distribu ...
.


Characterization


Probability density function

The inverse gamma distribution's
probability density function In probability theory, a probability density function (PDF), or density of a continuous random variable, is a function whose value at any given sample (or point) in the sample space (the set of possible values taken by the random variable) can ...
is defined over the support x > 0 : f(x; \alpha, \beta) = \frac (1/x)^\exp\left(-\beta/x\right) with
shape parameter In probability theory and statistics, a shape parameter (also known as form parameter) is a kind of numerical parameter of a parametric family of probability distributionsEveritt B.S. (2002) Cambridge Dictionary of Statistics. 2nd Edition. CUP. ...
\alpha and
scale parameter In probability theory and statistics, a scale parameter is a special kind of numerical parameter of a parametric family of probability distributions. The larger the scale parameter, the more spread out the distribution. Definition If a family o ...
\beta. Here \Gamma(\cdot) denotes the
gamma function In mathematics, the gamma function (represented by , the capital letter gamma from the Greek alphabet) is one commonly used extension of the factorial function to complex numbers. The gamma function is defined for all complex numbers except ...
. Unlike the
Gamma distribution In probability theory and statistics, the gamma distribution is a two-parameter family of continuous probability distributions. The exponential distribution, Erlang distribution, and chi-square distribution are special cases of the gamma distri ...
, which contains a somewhat similar exponential term, \beta is a scale parameter as the distribution function satisfies: : f(x; \alpha, \beta) = \frac


Cumulative distribution function

The
cumulative distribution function In probability theory and statistics, the cumulative distribution function (CDF) of a real-valued random variable X, or just distribution function of X, evaluated at x, is the probability that X will take a value less than or equal to x. Ev ...
is the
regularized gamma function In mathematics, the upper and lower incomplete gamma functions are types of special functions which arise as solutions to various mathematical problems such as certain integrals. Their respective names stem from their integral definitions, whic ...
:F(x; \alpha, \beta) = \frac = Q\left(\alpha, \frac\right)\! where the numerator is the upper incomplete gamma function and the denominator is the
gamma function In mathematics, the gamma function (represented by , the capital letter gamma from the Greek alphabet) is one commonly used extension of the factorial function to complex numbers. The gamma function is defined for all complex numbers except ...
. Many math packages allow direct computation of Q, the regularized gamma function.


Moments

Provided that \alpha > n, the n-th moment of the inverse gamma distribution is given by :\mathrm ^n= \beta^n \frac = \frac.


Characteristic function

K_(\cdot) in the expression of the
characteristic function In mathematics, the term "characteristic function" can refer to any of several distinct concepts: * The indicator function of a subset, that is the function ::\mathbf_A\colon X \to \, :which for a given subset ''A'' of ''X'', has value 1 at points ...
is the modified
Bessel function Bessel functions, first defined by the mathematician Daniel Bernoulli and then generalized by Friedrich Bessel, are canonical solutions of Bessel's differential equation x^2 \frac + x \frac + \left(x^2 - \alpha^2 \right)y = 0 for an arbitrary ...
of the 2nd kind.


Properties

For \alpha>0 and \beta>0, : \mathbb ln(X)= \ln(\beta) - \psi(\alpha)\, and : \mathbb ^= \frac,\, The
information entropy In information theory, the entropy of a random variable is the average level of "information", "surprise", or "uncertainty" inherent to the variable's possible outcomes. Given a discrete random variable X, which takes values in the alphabet \ ...
is : \begin \operatorname(X) & = \operatorname \ln(p(X))\\ & = \operatorname\left \alpha \ln(\beta) + \ln(\Gamma(\alpha)) + (\alpha+1)\ln(X) + \frac\right\\ & = -\alpha \ln(\beta) + \ln(\Gamma(\alpha)) + (\alpha+1)\ln(\beta) - (\alpha+1)\psi(\alpha) + \alpha\\ & = \alpha + \ln(\beta\Gamma(\alpha)) - (\alpha+1)\psi(\alpha). \end where \psi(\alpha) is the
digamma function In mathematics, the digamma function is defined as the logarithmic derivative of the gamma function: :\psi(x)=\frac\ln\big(\Gamma(x)\big)=\frac\sim\ln-\frac. It is the first of the polygamma functions. It is strictly increasing and strict ...
. The Kullback-Leibler divergence of Inverse-Gamma(''αp'', ''βp'') from Inverse-Gamma(''αq'', β''q'') is the same as the KL-divergence of Gamma(''αp'', ''βp'') from Gamma(''αq'', ''βq''): D_(\alpha_p,\beta_p; \alpha_q, \beta_q) = \mathbb\left \log \frac\right= \mathbb\left \log \frac\right= \mathbb\left \log \frac\right where \rho, \pi are the pdfs of the Inverse-Gamma distributions and \rho_G, \pi_G are the pdfs of the Gamma distributions, Y is Gamma(''αp'', ''βp'') distributed. : \begin D_(\alpha_p,\beta_p; \alpha_q, \beta_q) = & (\alpha_p-\alpha_q) \psi(\alpha_p) - \log\Gamma(\alpha_p) + \log\Gamma(\alpha_q) + \alpha_q(\log \beta_p - \log \beta_q) + \alpha_p\frac. \end


Related distributions

* If X \sim \mbox(\alpha, \beta) then k X \sim \mbox(\alpha, k \beta) \,, for k > 0 * If X \sim \mbox(\alpha, \tfrac) then X \sim \mbox\chi^2(2 \alpha)\, (
inverse-chi-squared distribution In probability and statistics, the inverse-chi-squared distribution (or inverted-chi-square distributionBernardo, J.M.; Smith, A.F.M. (1993) ''Bayesian Theory'' ,Wiley (pages 119, 431) ) is a continuous probability distribution of a positive-val ...
) * If X \sim \mbox(\tfrac, \tfrac) then X \sim \mbox\chi^2(\alpha,\tfrac)\, (
scaled-inverse-chi-squared distribution The scaled inverse chi-squared distribution is the distribution for ''x'' = 1/''s''2, where ''s''2 is a sample mean of the squares of ν independent normal random variables that have mean 0 and inverse variance 1/σ2 = τ2. The distribu ...
) * If X \sim \textrm(\tfrac,\tfrac) then X \sim \textrm(0,c)\, (
Lévy distribution In probability theory and statistics, the Lévy distribution, named after Paul Lévy, is a continuous probability distribution for a non-negative random variable. In spectroscopy, this distribution, with frequency as the dependent variable, is k ...
) * If X \sim \textrm(1,c) then \tfrac \sim \textrm(c)\, (
Exponential distribution In probability theory and statistics, the exponential distribution is the probability distribution of the time between events in a Poisson point process, i.e., a process in which events occur continuously and independently at a constant average ...
) * If X \sim \mbox(\alpha, \beta)\, (
Gamma distribution In probability theory and statistics, the gamma distribution is a two-parameter family of continuous probability distributions. The exponential distribution, Erlang distribution, and chi-square distribution are special cases of the gamma distri ...
with ''rate'' parameter \beta) then \tfrac \sim \mbox(\alpha, \beta)\, (see derivation in the next paragraph for details) *Note that If X \sim \mbox(k, \theta) (Gamma distribution with scale parameter \theta ) then 1/X \sim \mbox(k, 1/\theta) * Inverse gamma distribution is a special case of type 5
Pearson distribution The Pearson distribution is a family of continuous probability distributions. It was first published by Karl Pearson in 1895 and subsequently extended by him in 1901 and 1916 in a series of articles on biostatistics. History The Pearson system ...
* A
multivariate Multivariate may refer to: In mathematics * Multivariable calculus * Multivariate function * Multivariate polynomial In computing * Multivariate cryptography * Multivariate division algorithm * Multivariate interpolation * Multivariate optical c ...
generalization of the inverse-gamma distribution is the
inverse-Wishart distribution In statistics, the inverse Wishart distribution, also called the inverted Wishart distribution, is a probability distribution defined on real-valued positive-definite matrices. In Bayesian statistics it is used as the conjugate prior for the co ...
. * For the distribution of a sum of independent inverted Gamma variables see Witkovsky (2001)


Derivation from Gamma distribution

Let X \sim \mbox(\alpha, \beta), and recall that the pdf of the
gamma distribution In probability theory and statistics, the gamma distribution is a two-parameter family of continuous probability distributions. The exponential distribution, Erlang distribution, and chi-square distribution are special cases of the gamma distri ...
is : f_(x) = \fracx^e^, x > 0. Note that \beta is the rate parameter from the perspective of the gamma distribution. Define the transformation Y = g(X) = \tfrac. Then, the pdf of Y is :\begin f_Y(y) &= f_X \left( g^(y) \right) \left, \frac g^(y) \ \\ pt&= \frac \left( \frac \right)^ \exp \left( \frac \right) \frac \\ pt&= \frac \left( \frac \right)^ \exp \left( \frac \right) \\ pt&= \frac \left( y \right)^ \exp \left( \frac \right) \\ pt\end Note that \beta is the scale parameter from the perspective of the inverse gamma distribution. This can be straightforwardly demonstrated by seeing that \beta satisfies the conditions for being a
scale parameter In probability theory and statistics, a scale parameter is a special kind of numerical parameter of a parametric family of probability distributions. The larger the scale parameter, the more spread out the distribution. Definition If a family o ...
. :\begin \frac &= \frac \frac \left( \frac \right)^ \exp(-y) \\ pt&= \frac \left( y \right)^ \exp(-y) \\ pt&= f_(y) \end


Occurrence

*
Hitting time In the study of stochastic processes in mathematics, a hitting time (or first hit time) is the first time at which a given process "hits" a given subset of the state space. Exit times and return times are also examples of hitting times. Definitions ...
distribution of a
Wiener process In mathematics, the Wiener process is a real-valued continuous-time stochastic process named in honor of American mathematician Norbert Wiener for his investigations on the mathematical properties of the one-dimensional Brownian motion. It is o ...
follows a
Lévy distribution In probability theory and statistics, the Lévy distribution, named after Paul Lévy, is a continuous probability distribution for a non-negative random variable. In spectroscopy, this distribution, with frequency as the dependent variable, is k ...
, which is a special case of the inverse-gamma distribution with \alpha=0.5.


See also

*
Gamma distribution In probability theory and statistics, the gamma distribution is a two-parameter family of continuous probability distributions. The exponential distribution, Erlang distribution, and chi-square distribution are special cases of the gamma distri ...
*
Inverse-chi-squared distribution In probability and statistics, the inverse-chi-squared distribution (or inverted-chi-square distributionBernardo, J.M.; Smith, A.F.M. (1993) ''Bayesian Theory'' ,Wiley (pages 119, 431) ) is a continuous probability distribution of a positive-val ...
*
Normal distribution In statistics, a normal distribution or Gaussian distribution is a type of continuous probability distribution for a real-valued random variable. The general form of its probability density function is : f(x) = \frac e^ The parameter \mu ...
*
Pearson distribution The Pearson distribution is a family of continuous probability distributions. It was first published by Karl Pearson in 1895 and subsequently extended by him in 1901 and 1916 in a series of articles on biostatistics. History The Pearson system ...


References

*Hoff, P. (2009). "A first course in bayesian statistical methods". Springer. * {{DEFAULTSORT:Inverse-Gamma Distribution Continuous distributions Conjugate prior distributions Probability distributions with non-finite variance Exponential family distributions